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Abstract. We use the distances introduced in a previous joint paper to exhibit the gradient 
flow structure of some drift- diffusion equations for a wide class of entropy functionals. Functional 
inequalities obtained by the comparison of the entropy with the entropy production functional reflect 
the contraction properties of the flow. Our approach provides a unified framework for the study of 
C " ^ , the Kolmogorov-Fokker-Planck (KFP) equation. 
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1. Setting of the problem. Our starting point concerns nonnegative solutions 
. with finite mass of the heat equation in R'' 

<C ; dtut = Aut . (1.1) 



(N 



It is straightforward to check that for any smooth enough solution of (jl.ip and any 



2 , convex function t/j, 

^ I i^{ut)dx^- [ i;"{ut)\Dut\''dx 

SO that /jjd ip{ut) dx plays the role of a Lyapunov functional. To extract some informa- 
tion out of such an identity, one needs to analyze the relation between J^^ V'(''^t) dx and 
Xf^ ! /ftd V'"(wt) l-DwtP dx. This can be done using Green's function or moment estimates, 

. with the drawback that these quantities are explicitly t-dependent. It is simpler to 

ly-j ' rewrite the equation in self-similar variables and replace by the Fokker-Planck 

O ' (FP) equation 

dtvt = Avt + \/ -ixv) . (1.2) 

This can be done without changing the initial data by the time-dependent change of 
^ , variables 

c5 ' 1 

..(.)-^^.(^), Rit)-VTT2t. 

We shall restrict our approach to nonnegative initial data uq = vq. By linearity, we 
can further assume that 

vtdx — Uf dx — Uodx — 1 



*Ceremade (UMR CNRS no. 7534), Universite Paris-Dauphine, place de Lattre de Tassigny, 
75775 Paris Cedex 16, France. E-mail: dolbeauKSceremade. dauphine.fr 

tCeremade (UMR CNRS no. 7534), Universite Paris-Dauphine, place de Lattre de Tassigny, 
75775 Paris Cedex 16, France. E-mail: nazaretaceremade.dauphine.fr 

Universita degli studi di Pavia, Dipartimcnto di Matcmatica "F. Casorati", Via Ferrata 1, 27100, 
Pavia, Italy. E-mail: giuseppe.savare@unipv.it 



without loss of generality. We shall also assume that is defined on R+ . Up to the 
change of ip into -0 such that ■i/;(s) = i}{s) — V'(l) — — 1), we can also assume 

that i/i is nonnegative on R''" and achieves its minimum value, zero, at s = 1. 

Eq. (jl.2l) has a unique nonnegative stationary solution w = 7 normalized such 
that Jjgd 7 = 1 , namely 

e-l^lV2 

If we introduce pt ~ 'i't/7, then pt is a solution of the Ornstein-Uhlenbeck, or 
Kolmogorov-Fokker-Planck (KFP), equation 

dtpt = Apt - X ■ Dpt (1.3) 

with initial data po = vq/j. After identifying 7 with the measure jJtf^, the relevant 
Lyapunov functional, or entropy, is J^^ V'(Pt) and 

^ / ^(p,)rf7 = - / i^"{pt)\Dpt\'dj . 

We shall restrict our study to a class of functions ip for which the entropy and the 
entropy production functional are related by the inequality 

2A / i^{p)dj< [ r{p)\Dp\'dj (1.4) 



for some A > (it turns out that in the case of the Gaussian measure we can choose 
A = 1). This allows us to prove that the entropy is exponentially decaying, namely 



4'ipt)dj< i^J^Jipo)djj e-'^' Vt>0, (1.5) 

if Pt is a solution of (|1.3p and if A is positive. A sufficient condition for such an 
inequality is that 

the function h := l/tp" is concave (1-6) 

(see for instance [8]). At first sight, this may look like a technical condition but it 
has some deep implications. We are indeed interested in exhibiting a gradient flow 
structure for (|1.2p associated with the entropy or, to be more precise, to establish 
that, for some distance, the gradient flow of the entropy is actually (|1.2p . It turns out 
that (11.61) is the natural condition as we shall see in Section 13.21 



The entropy decays exponentially according to (II. 5p not only when one considers 
the L^(M'*) norm (the norm of the square integrable functions with respect to the 
Gaussian measure 7), i.e. the case V'(p) = (p ^ 1)^/2, or the classical entropy built 
on i/)(p) = plogp, for which ()1.3p is the gradient flow with respect to the usual 
Wasserstein distance (according to the seminal paper [24] of Jordan, Kinderlehrer 
and Otto). We also have an exponential decay result of any entropy generated by 

(2 — a)(l — a) 

2 



and more generally any ip satisfying (jl.6p . Notice by the way that 'il'{p) = i^aif) is 
compatible with (I1.6P if and only if a e [0, 1) and that V'(p) = P logp appears as the 
limit case when a — > 1_. 

The exponential decay is a striking property which raises the issue of the hid- 
den mathematical structure, a question asked long ago by F. Poupaud. As already 
mentionned, the answer lies in the gradient flow interpretation and the construction 
of the appropriate distances. Such distances, based on an action functional related 
to V': have been studied in [STj. Our purpose is to exploit this action functional for 
the construction of gradient flows, not only in the case corresponding to (|1.3|) but also 
for KFP equations based on general A-convex potentials V. For the convenience of 
the reader, the main steps of the strategy have been collected in Section [2J without 
technical details (for instance on the measure theoretic aspects of our approach). 

Coming back to our basic example, namely the solution of (|1.3p , we may observe 
that a solution can easily be represented using the Green kernel of the heat equation 
and our time-dependent change of variables. If ipip) = ipaip), ol € [0,1), we may 
observe that the exponential decay of the entropy can be obtained using the known 
properties of the heat flow and the homogeneity of "00 , while the contraction properties 
of the heat flow measured in the framework of the weighted Wasserstein distances 
introduced in pT] can be translated into the exponential decay of the distance of the 
solution of (11.31) to the gaussian measure 7, if we assume that p7 is a probability 
measure. We shall however not pursue in this direction as it is very specific of the 
potential Vix) — \ |xp and of the heat flow (for which an explicit Green function is 
available) . 

Let us conclude this introductory section by a brief review of the literature on the 
functional inequalities based on entropies such that (|1.6I) holds. Such functionals are 
sometimes called ip-entropies. In this paper, we shall however avoid this denomination 
to prevent from possible confusions with the function (j) and the functional $ used 
below to define the action and the weighted Wasserstein distances Wh- 

We shall refer to (THl US] for a probabilistic point of view. A proof of p.5p under 
Assumption (|1.6p and an hypothesis of convexity of V can be found for instance 
in [S] or in the more recent paper [15]. This approach is based on the Bakry-Emery 
method [HIUHI and heavily relies on the flow of KFP or, equivalently, on the geometric 
properties of the Ornstein-Uhlenbeck operator (using the carre du champ: see |15|V 
Strict convexity of the potential is usually required, but can be removed afterwards 
by various methods: see [HlIiniEDj- For capacity-measure approaches of (|1.4p . we 
shall refer to [TTJ [121 US] • The inequality (|1.4|) itself has been introduced in ^3j with 
a proof based on the hypercontractivity of the heat flow and spectral estimates, and 
later refined and adapted to general potentials in 

Concerning gradient fiows and distances of Wasserstein type, there has been a 
huge activity over the last years. We can refer to PH HI] for fundamental ideas, and 
to two books, [3j[27|, for a large overview of the field. Many other contributions in 
this area will be quoted whenever needed in the proofs. 

2. Formal point of view: definitions, strategy and main results. In Sec- 
tion [1] we have considered the case of the harmonic potential V{x) — ^ We 
generalize the setting to any smooth, convex potential V : R'^ — >■ R with 

DV>AI, A>0, (2.1) 
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and consider the reference measure 7 given by 

-f:=e-^^'^ (2.2) 
where denotes Lebesgues's measure on E*^. We assume that 



7(M'') = / e-^ dx =: Z < 00 . (2.3) 
Next we define the action density (j) : (0, 00) X M'' ^ M as 

(l)ip,w) := g{p) \w\ 



,2_ 



h{p) 

for some concave, positive, non decreasing function h with sublinear growth. The 
function g is therefore convex and also satisfies the condition 

2(5')' < 5 5". (2.4) 

Our main example is h{p) := p" for some a G (0, 1). Based on the action density, we 
can define the action functional by 

^p,w) / q^ip,w)dj. (2.5) 



The Kolmogorov-Fokker-Planck (KFP) equation. With the notations := 
A — DV ■ D, the equation 

dtpt - A^Pt = (2.6) 

determines the Kolmogorov-Fokker-Planck (KFP) flow St : po ^ pt- Its first vari- 
ation, Rt : Wq I— > Wt, can be obtained as the solution of the modified Kolmogorov- 
Fokker-Planck equation 

dtWt - A^wt + D'^Vwt = . 

If Wq = DpQ, then Wt = Dpt, which can be summarized by 

D{StPo) = Rt{Dpo) . 

By duality, using the notations Vy-w := V • ii; — DV ■ w and V • ti; := J2^=i dwi/dxi, 
if ■ Wq = pq, we also find that V.y ■ Wt ~ pt, which amounts to 

■ {RtWQ) ^ St{^^ ■ Wq) (2.7) 

(see Theorem 15.41 for details). If /i = p7, we define the semigroup §f acting on 
measures by §i/i := {Stp) 7. 

Consider an entropy density function ij) such that "0(1) = V''(l) = 0- If we define 
the entropy functional by 
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and the entropy production, or generalized Fisher information functional, as the action 
functional for the particular choice w = Dp, i.e. 

P{p) -.^HP.Dp) , 

then, along the KFP flow, we get 

(pt) = -P{pt) = -1>(pt, Dpt) (2.8) 



for a solution pt of (|2.6p if 

tp" = g . 

Notice that (II. 6p and (|2.4I) are equivalent. See Section for more details. The main 
estimate for this paper goes as follows. 

Theorem 2.1. Under Assumptions (|2.ip - (|2.4p . if ^{po,Wo) < oo, pt = Stpo 
and Wt — RtWQ, then 

^<i>ipt,Wt) + 2X^{pt,Wt)<0 Vi>0. 
dt 

In particular the action functional decays exponentially if A is positive: 

<^{pt,Wt)<e-^^*<t>{po,wo) yt>0. (2.9) 

At formal level, this follows by an easy convexity argument. The rigorous proof 
requires many regularizations. See Theorem 16.11 for a more detailed version of this 
result. Now let us review some of the consequences of Theorem 12. II 

Entropy, entropy production and generalized Poincare inequalities. We can 

now apply Theorem 12.11 to the KFP flow. With w — Dp, we find that the entropy 
production functional decays exponentially: 

j^Pipt) + 2XP{pt)<0, P(pt) < e2^*P(po) yt>0 (2.10) 

if A is positive. By integrating (|2.8p along the KFP flow when t varies in using 
(|2.10[) and '5(1) — 0, we recover for p — po the generalized Poincare inequalities 

^{p)<^Pip) (2.11) 

found by Beckner in |13| in the case of the harmonic potential and for h{p) :— p", 
a e (0, 1), and generalized for instance in [8^. Such inequalities interpolate between 
Poincare and logarithmic Sobolev inequalities. 

If we combine (|2.1ip with (|2.10p . we find that the entropy decays according to 

^*(pt) + 2A*(pt) <0, *(pt)<e^'^**(po) Vt>0. 



By integrating from to i the inequality 

d 
di 



(tPip,)) = P{p,)+tjP{p,) < P{p,) = -|*(P*) 



which itself follows from (|2.8p and (I2.10p . we observe a first regularization effect along 
the KFP flow, namely 

tP{pt) <-^{po) Vt>0. (2.12) 

If A is positive, we can refine this estimate and actually prove by the same method 
that P(pj) < $(p„) for any t > 0. 

The /i-Wasserstein distance. If /i is a measure with absolutely continuous part p 
with respect to 7, and singular part /i^, if 1/ is a vector valued measure which is 
absolutely continuous with respect to 7 and has a modulus of continuity w, i.e. if 

p = p^ + p^ and i/ = w^, (2-13) 

we can extend the action functional $ to the measures p and v by setting 



<^>{p,v) ^<^>{p,w) = <j){p,w)d-f. 

We shall say that there is an admissible path connecting pa to pi if there is a solution 
(/is, ^'s)se[o,i] to the continuity equation 

dsPs + V • = , s e [0, 1] , 

and will denote by r(/io, pi) the set of all admissible paths. With these tools, we can 
define the h- Wasserstein distance between po and pi by 

Wl{po,pi) ■.= mi^^J^ ^Ps,iys)ds : {p,i^) e T{pa, pi)^ 

Notice that h in "ft,- Wasserstein distance" refers to the dependence of $ in h through 
the action density (j), the usual Wasserstein distance corresponding to h{p) — p. If 
ipt)t£{o,T) is a curve of measures, its h-Wasserstein velocity \pt \ is determined by 

\pt\^ ^mi{^p,u) : V-u^^dtPt}. 

Using the decomposition (|2.13p . we compute the derivative of the entropy along the 
curve {pt)te{o,T) as 

'^-*(/3t) = / '4''{pt)dtptd^ ^ [ tp"{pt) Dpt ■ wtd-y 



(2.14) 



dt 

and find that 

VriP^Dpt-^/w^wtdj<y^p{^)\pt\ 

at J^d 

by the Cauchy-Schwarz inequality. Along the KFP flow, we know that 



-^(pt) = -P{pt) = -\pt\' = -/pRiAtI , 

which is the equality case in ()2.14p . This characterizes the KFP flow as the steepest 
descent flow of the entropy 4', i.e. this is a flrst charaterization of KFP as the gradient 
flow of with respect to the /i- Wasserstein distance. 
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The KFP flow connects fi — p-f with p^ — 7 and it has been established in 
that one can estimate the length of the path by 



[pn)^ / VPUt)dt^ / \pt\dt (2.15) 
Jo Jo 



(see Section [3T5] for details). According to ()2.10p . we get 

WH{pn)<VP{p) e-^' dt = - ,/P{p) 



This establishes the entropy production - distance estimate 

1 



Whip,-/)<jVPip) : if M = P7 



Along the KFP flow, we also find that 



using (j2.1ip . By applying (|2.15p . this establishes the (Talagrand) entropy - distance 
estimate 

wl{pn)<\-^{p). 

Contraction properties and gradient flow structure. Here as in [21], we use 

the technique introduced in [26] and extended in [TT] § 2] : we consider a geodesic (or 
an approximation of a geodesic), and evaluate the derivative of the action functional 
along a family of curves obtained by evolving the geodesic with the KFP flow. 

Consider an e-geodesic {p^ .w") connecting = p°7 to p^ = p"^ 7, i.e. an admis- 
sible path in V{pQ,pi) such that $(pg,t(;g) < M^^(Po,Po) + e for any s e (0, 1) and 
observe that by (|2.7p . we know that (pf = Stp^ ,wl = RtW^) is still an admissible 
curve connecting Stp^ to Stp^ ■ Therefore (|2.9p yields 

WI{pIpI)< f\{pt,wt)ds<e-'^' (\{plwl)ds<e-^'''{Wl{plpl)+e) , 
which, by letting e — 0, proves that the KFP flow contracts the distance: 

Wh{§tf^°,St^i^) <e-^'Wh{p",p') yt>o. 

See Theorem 17. II for more details. 

Next, we should again consider an e-geodesic, but for simplicity we assume that 
there is a geodesic {p^,w^) connecting a = p^ = p° j to p = p^ = 7, i.e. such that 
^{p^^w") = W^{a,p), and consider the path 

ipt,wt) {Sstp\RstWs+tDpt) 

connecting a to pt := St/i. Notice that our notations mean that p'* = Pq. Since 

dspt ^Pt+t A^pt ^W^-{wt+t Dpi) , 
7 



the path is admissible and, as a consequence, 




We can therefore differentiate the right hand side in the above inequahty instead of the 
distance and furthermore notice that it is sufficient to do it at i = 0; see Theorem 1 7. 2 1 
and its proof for details. Along the KFP flow we find that 



This is the strongest metric formulation of a X-contracting gradient flow. Here we 
have defined the relative entropy as 5'(/i|7) := ^{p) if /i <C 7 and /i ~ pj, and 
^'(cr I 7) := +00 otherwise. Hence we recover a second characterization of the fact 
that KFP is the gradient flow of 4* with respect to Wh- 

As another consequence, the entropy ^' is geodesically X-convex. This follows 
from (|2.16p . Fix a geodesic p^ between /i" and p^ , follow the evolution of /i^ by KFP 
taking first /i*^ and then p^ fixed, and apply (j2.16p with pt := §t/x* and p = p^ ov 
p = p^. Because of the minimality of the energy along the geodesic at time t — 0,hy 
summing the two resulting inequalities we prove the convexity inequality of ^. See 
pn Theorem 3.2] for more details. 

As a final observation, let us notice that, directly from the metric formulation 
(|2.16p . it follows that the KFP flow also has the following regularizing properties: 



The first estimate can indeed be obtained by integrating (I2.16|) (with A = and 
(7 = 7) from to t and recalling that < ^'(pt) is decreasing. As for the second one, 
we observe that also t P{pt) is decreasing by (j2.10p . so that (|2.12l) and (|2.16l) yield 



A further integration in time from to t completes the proof. Notice that it is crucial 
to start from a measure p = po^ at finite distance from 7. 

3. Definition and properties of the weighted Wasserstein distance. In 

this section we first recall some definitions and results taken from [3T] . The measure 7 
and the functions and ip are as in Section [51 and we assume that Conditions (j2.ip - 
(|2.4[) are satisfied. 

3.1. Properties of the potential. Let 1/ : R'' -> R be a A-convex and continu- 
ous potential. We assume that A is nonnegative and A-convexity means that the map 
X i-> V{x) — -I |a;p is convex. When V is smooth in W^, this condition is equivalent 
to (|2.ip . We are assuming that is integrable in R'', so that we can introduce 
the finite, positive, log-concave measure 7 defined by (12. 2p . For simplicity, we shall 
assume that 7 is a probability measure, i.e. Z = 1, which can always be enforced by 
replacing V hy V + logZ. The potential V being convex, the integrability of 
is equivalent to the property that V{x) t cx) at least linearly as \x\ f 00; see e.g. [3 
Appendix]. As a consequence, there exist two constants ^ > 0, S > such that 



i jWiipua) + ^ Wiipua) < *(a 1 7) - *(Mt I 7) • 



(2.16) 



*(pt) < :^M^^'(Po,7) and Pipt) < -.W^ipo,-/) Vi>0. 




V{x)> A\x\- B VxeR'^. 



(3.1) 
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We recall that non smooth, convex potentials V can be approximated from below 
by an increasing sequence of convex potentials : 

Vnix) := ^ |xp + inf (f \x - + V{y) ~ ^ \y\A . 

2 yes.'' \ 2 2 / 

Moreover, the potentials Vn are A-convex and, even in the case A = 0, they satisfy 
conditions p.ip with respect to constants A and B which are independent ofn. In par- 
ticular, the log-concave measures 7„ := e~^"^'^ weakly* and monotonically converge 
in C°(R'')' to 7. By this regularization techniques, many results could be extended 
to the case when V is just lower semicontinuous and can take the value -l-oo. 

3.2. Convexity of the action density. As in Section [51 consider g and h on 
(0,00) such that g{p) — l/h{p) and (j){p,w) — g{p) \w\'^ — \w\'^/h{p). The following 
result has already been observed in [21] but we reproduce it here for completeness. 

Lemma 3.1 (Convexity of the action density). With the notations of Section\^ 
the action density cj) is convex if and only if h is concave on (0, 00) or, equivalently, 
if g satisfies Condition (|2.4p . 

Proof. By standard approximations, it is not restrictive to assume that g, h & 
C^(0, 00). First of all observe that 

g h =2{g) -gg , 

so that h" is nonpositive if and only \i 2{g'Y < 9 g" ■ Next we evaluate the second 
derivative of </) along the direction of the vector 2; = (x, y) e K x M'* as 

(d20(p, w) z, z) = g"{p) \w\'x^ +^g'{p)wxv + 2 g{p) jyp . 

By minimizing with respect to x £ R, we get 

g"{p) \w\^ (d2</,(p, w) z,z)>2 [g"{p) \w\^ g{p) \y\^ - 2 {g\p) w ■ y)'] (3.2) 

if g"{p) > 0, with equality for the appropriate choice of x. The convexity of (j) is thus 
equivalent to 

g"{p)\w\^gip)\y\'>2{g'{p)wy)' Vp>0, \fy,weR''. 
If (j) is convex, by choosing y := h{p) g'{p) w and using h{p) g{p) = 1, we get 

g"{p)\w\^h{p){g'{p)r\w\^>2[h{p){g'{p)f\w\^f Vp>0, V G M^ 

which yields (|2.4p . Conversely, the convexity of (j> follows from {w ■ y)^ < \w\'^ □ 

We can introduce a modulus of convexity as follows. Assume that for some a G 
(0, 1] we have 

g{p)g"{p)>{l + a-^){g'{p)r Vp>0. (3.3) 
By (|3.2[) . we obtain the refined estimate 

{B^cl){p,w)z,z)>2l3cl>{p,y) V z = {x,y) e with (3 := ^—^ . (3.4) 

1 + a 
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Such a refinement has interesting consequences, which have been investigated in [5J 
[71 [5^ . The refined convexity assumption (|3.3p is equivalent to 



/ji/" is 



concave . 



Remark 3.2 (Main example). Our main example is provided by the function 
h{p):^p\ 0<«<1, = 

P 

which satisfies p.4p . When a = we simply get 

(j){p,w) := \w\'^ , 
and for a = I we have the 1-homogeneous functional 



4>{p,w) 



P 



Notice that the above considerations can be generalized to matrix- valued functions 
g and h: see [H] Example 3.4]. 

3.3. The action functional on densities. The action functional $ induced 
by 4> has been defined by (j2.5p . with domain 

2)($) := [{p,w) e Ll^iR'') X Li(R'*;R'*) : p > , Hp,w) < oo} . 

Assuming as in Section r3.2l that cf) convex, it is well known that if {pk)keN and {wk)ken 
are such that (pkjWk) G I'(<f>) for any k G N and if pfc ^ p in Ll^{M.'^), and Wk 
w E L}^{M.'^; R"^) as n t oo, then by lower semi-continuity of $, we have 

limini ^{pk,Wk) > ^{p,w) . 

n't'oo 



Lemma 3.3 (Approximation by smooth bounded densities). Consider two func- 
tions p G L^(]R'*) and w G L!!^(R'^; R"*) such that p>0 and ^{p,w) < oo. Then there 
exist two sequences {pk)kefi ^.^d (wk)keN of bounded smooth functions (with bounded 
derivatives of arbitrary orders ) such that vai^d pk > and 

\im. pk=p inLUm.'^), YiuYWk^w m L:!,(R'^; R'^) , 



Pkd'y= / pd-y VfcGN and lim / <j){pk,Wk) d'y — / (t){p^w) d^ . 

Proof. We first truncate p and w from above as follows. Let m :— J^g^^pdj 
and, for any fc G N, rrife := Jj^dif /\ k)d^, Rk ■= {x G R'' : p{x) < k}. We set 
Pk '■— rnj^^ m{p A k) and 



Wk{x) 



w{x) if |';i'(a;)| < k and x € Rk , 
otherwise . 
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Clearly — > p, Wk — > w pointwise 7 a.e. in M'', so that Fatou's Lemma yields 



< / w) d'j < I (j){p, w) d'j = w) , 



liminf $(pfc,ti;fe) = $(p,u;) . (3.5) 

Since p A k ^ p in L}^{M.'^) as fc t 00, we have mk m and pfc — t- p in _L!!j,(R''). The 
dominated convergence theorem also yields Wk — > to in L!!|,(]R''; M''). Finally, since 
Pk ^ P and \wk\ < I If I on Rk, and since g is non increasing, 



JRk 

so that the "liminf" in p.Sp is in fact a limit. 

Next we perform a lower truncation on p. By a diagonal argument, it is sufficient 
to approximate the functions pk and Wk we have just introduced, so we can assume 
that p is essentially bounded by a constant k and we omit the dependence on k. For 
(5 > we now set ps := {p + S) m/ {m + 6). Observe that 

PS - ^ {p - m) and p - ps = — [p - m) 

m + 771 + 

so that m < ps < p on the set i?^ and, by convexity of 5, we get 

|g'(77i)| (fc — 777) 



gips) < Cs g{p) where Cs ^1 + 5- 



g{k) {5 + m) 



On the other hand, on the set Rm, we have p < ps, and then g{ps) < g{p)- As a 
consequence, 

(j){ps,w)d'y < Cs / (j){p,w)d'y. 

We can then pass to the limit as 5 J, 0, since ps p pointwise. 

The last step is to approximate the functions p and w, with S < p < k, \w\ < k, 
by smooth functions. We consider a family of smooth approximations p^ and 
obtained by convolution with a smooth kernel. We finally set rrie := /^d Pe d"/ and, in 
this framework, redefine p^ := mp^/m^. Since {pe,Wg) converges to {p,w) pointwise 
a.e. in R'^ and is uniformly bounded, we can pass to the limit as above when e ^ 0. □ 

3.4. The action functional on measures. Since we assumed that h is con- 
cave and strictly positive for p > 0, /7 is an increasing map, so that g is decreasing. 
We extend h and g to [0, 00) by continuity and we still denote by (p the lower semi- 
continuous envelope of (f) in the closure [0, 00) x W^. If h{0) > then 17(0) < 00 and 
<j){0,w) = g{0) |if p. When h{0) = we have g{0) = 00 and 



00 if It) 7^ , 
if to = . 



We also introduce the recession functional 



' (p, w) := sup Y 0(A p, A ti;) = lim ^ '/'(A p,Xw) , 
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which is still a convex and lower semicontinuous function with values in [0,oo], and 
1-homogeneous. It is determined by the behaviour of h{p) as oo. If we set 

/.-:=lin.M=:^, 
ptoo p g°° 

we have 

itw = 



and 



h'-p-^ P ' ^^^^ h°°>0 

OO if /9 = and w ^0 



Let /X e M+(IR°) be a nonnegative Radon measure and let f e M(M'';R'') be a 
vector Radon measure on R*^. We write their Lebesgue decomposition with respect 
to the reference measure 7 as 

/It := P7 + /i"*- , u :=wj + u-^. 

We can always introduce a nonnegative Radon measure a S M"'"(K'') such that p.^ = 
p-^a <C (T, I/*-"- = w-^a <^ a, e.g. a := fj,-^ + \v-^\ and define the action functional 

^p,u\j):= f cl){p,w)dj+ f (l)'^{p^,w^)da . 

Since is 1-homogeneous, this definition is independent of a. As we have done up 
to now, wc shall simply write v) = ^{p,v\ 7) when there is no ambiguity on the 
reference measure 7. 

Remark 3.4. Ifh has a sublinear growth, then h°° = and, as a consequence, 
if ^{p,v) < +00, then we have 



= to • 7 <C 7 and ^{p,i>) = / (j){p,w) 

Js.<i 



d'y 



so ^{p,v) is independent of the singular part p,-^. When h has a linear growth, 
i.e. h°° > 0, if^{p,u) < +00, then we have 

=w^ ■ p^ . 

In both cases, one can choose a = p^, so that 

1/ = li; • 7 ^ 7 + ■ p^ 

and if g°° = 1/ h°° is finite, then we have 

^p,u\j)= f cj){p,w)d^ + g^ [ \w^\^dp^, 

while the last term simply drops if h°° = 0. 
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Lemma 3.5 (Lower semicontinuity, regular approximation of the action func- 
tional). The action functional is lower semicontinuous with respect to the weak con- 
vergence of measures, i.e. if (7n), (/in) cind (i^n) o.fG sequences such that j„ 7 
weakly in M+(M''), ij,„ ^ weakly in M+(R'') and v„^*v in M(M'';IR'') as n f 00, 
then 

liminf $(/i„, | 7„) > $(/z, «^ | 7) . 

nfoo 

Moreover, for every fi £ M+(R'') and v £ M(R'';R'') such that < 00, there 

exist sequences (/i„) and (vn) for which 

fJ-n := Pn 7 with pn £ C^(M'*) and inf /?„ > , iv„ := w,-, 7 with Wn £ (M'^; R'^) 

such that 

^ p and Un ^ u , lim / 0(p„, u;„) ^7 = $(/i, | 7) . (3.6) 

"too Jjjd 

Proof. The first statement is a well known fact about lower semicontinuity of 
convex integrals (see e.g. [T]). Concerning the approximation property (13. 6p . general 
relaxation results provide a family of approximations in L}^{W^). We can then apply 
Lemma 13.31 and a standard diagonal argument. □ 

3.5. The weighted Wasserstein distance. Denote by TOW^) the collection of 
all Borel subsets of R'^, by M+(R'') the collection of all finite positive Borel measures 
defined on R'^ and by ^{W^) the convex subset of all probability measures i.e. all 
/i £ M+(R'*) such that y.{W^) = 1. If M(R'';R'^) is the set of the vector valued Borel 
measures u : 23 (R**) — > R"* with finite variation, i.e. such that 

\u\{B) :=sup{^|i.(i3,)| : 

B ~ \^ Bj , Bj £ 'B(R'^) pairwise disjoint , n < ooj < 00 

j<n 

for any B £ !B(R'*), then is in fact a finite positive measure in M"'"(R'^) and u 
admits the polar decomposition 1/ — w\i/\ where the Borel vector field w belongs to 
L|^j^|(R'*;R'*). We can also consider 1/ as a vector (1/^,1/^, • • • ,1''^) of d measures in 

M(R'^;R). 

For any T > 0, let C£(0, T; R'') be the set of time dependent measures (/it)te[o.T]j 
(i^t)te(o,T) such that 

1. t H> /it is weakly * continuous in M[J^^(R'^), 

2. (i>'t)tG(o,T) is a Borel family with jiVf |(_B_r) dt < od for any i? > 0, 

3. (/i, f ) is a distributional solution of 

dtfit + V • ivj = in R'^ X (0, T) . 

As in f^, we define the weighted Wasserstein distance as follows. 
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Definition 3.6. The {h,^)-Wasserstein distance between /xq and /ii G M[+^(M'') 
is defined by 

rr 1 1^/^ 

Wh^■yi^J.a,^J.l) :=inf||^/„ ^{jjt^vtll) dt^ : 

e e£(0,l;M'*), ^t=o = /io, Mt=i=Mi} (3.7) 

wii/i I' I 7) := i") + ^°°(w^) if /i = pj + p^ and u ^ iw^ + ji^ , 
1/ 1 7) := cx) otherwise, and liniA-foo A (/)(A, ii;). 

We denote by JAh.^lc^] the set of all measures /i G M[]^^(R'^) which are at finite 
Wh, J -distance from a. 

Notice that in i2T] we were using the notation W^^^ instead of Wh^-y- Whenever 
there is no ambiguity on the choice of the measure 7, we shaU simply write Wh- The 
next resuh is taken from [211 Theorem 5.6 and Proposition 5.14] 

Theorem 3.7 (Lower semicontinuity) . If (j) satisfies (|2.4p and (|2.5I) . the map 
(/iOiMi) ^ W^/i,7(M0i is lower semicontinuous with respect to the weak * conver- 
gence in MjJjj,(]R''). More generally, suppose that in 'M^^iM.'^), /i" is mono- 
tonically decreasing w.r.t. n and pointwise converging to h, and iIq^* ijlq, /x"^*/Lti in 
M+^(M'') as n t oo- Then 

hminf iy/,,>,^,>(/io,y^") > Wh^^ijio, pi) . 

n'f'oo 

// moreover 7„ > 7 we have 

n— >+CxD 

It is possible to reparametrize the path connecting /ig to fii in the definition of Wh,j 
and establish that, for any T > 0, 

1 /2 

Wh,^{a,rj) ■.= mi{VT[j^ ^fit,ut\l)dt'\ : G eE{0,T; a ^ r])} 

where eE{0,T;a 77) denotes the set of the paths (/x, z^) G ee(0,T;M'') such that 
/xt=o = f and iit=T — V- By ^21i Theorem 5.4 and Corollary 5.18], we have the 

Theorem 3.8 (Existence of geodesies). Whenever the infimum in p. 71) has a 
finite value, it is attained by a curve {p^u) G e£0(0, 1;R'^) such that 

'^{^iuut\-1)^wl^{^i^,^il) vt g (0,1) if^ a.e. 

In this case we have the equivalent characterization 

M^^,^(a,77)=min{/J[<i>(^i,«^i|7)]'/'dt : (p,v) ^ QE{Q,T-<j ^ t^)]. 

The curve {^^t)te[o,i] associated to a minimum for p.7p is a constant speed mimimal 
geodesic: 

Wh,'y{fJ.s,fJ't) = |i - s| Wh,f{^io, ti-i) V s , ^ G [0, 1] . 



We may notice that the characterization of Wh^jio', rf) in terms of Jj^ ^ ^{f^t, \ 7) dt 
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allows to consider the case T — +00. By ^ Chap. 1] (also see ^21t p. 222]), one 
knows that 



Wh.j{fJ.o,fJ-T) < / \fJ-t\dt with l^tl := lim 
Jo 



for any absolutely continuous curve 1 1-^ such that /it=o = Mo Sind ^t=T = Mt- 

Now let us come back to the formal point of view of Section [2] and establish 
(|2.15p in this framework. Assume that pt is given by KFP and Wt = Dpt- The curve 
t^t ^ Ptl connects Po = Pol with p^ = 7 and, using 



= ^'^{pt,wt\i) = \p't\ 

it follows that 



Wh,^{pon)< / ^P{pt) dt = / 

Jo Jo 

as already noted in Section [5] (equality case in (I2.14p '). On the other hand, for any 
(^, iv) e CE{0,T;pq Pt), T e (0, 00), we have \pt\ < yj<^{pt,i>t) and so 

/ \fit\dt< [ ^^{pt,ut)dt . 
Jo Jo 

By taking first the infimum (p, i>) € Ci?(0, T; pq — >■ px) and then the limit T — > 00, 
we also find 

POO 

/ \P't\dt<Wh,^{pon) , 
Jo 

thus proving the equality in the above inequality. This completes the proof of ()2.15p . 

4. Entropy and entropy production. Let us consider now a function ^jj such 
that '4'"{x) — 5(2;) for any a; > 0. Among all possible choices of ip, we consider 
in particular the convex functions ipa : [0, 00) — > [0, 00) depending on a > and 
characterized by the conditions 

C(a;)=5(a;), fpa{a) = ip'aia) = , i.e. iPaix) = {x - r) g{r) dr . 



Observe that V'a G C^(0, 00) has a strict minimum at a > and it satisfies the 
transformation rule 

4'a{x) = ip{x) — V'(o) — (x — a) V a > , 

independently of the choice of ip (for a given function g). When g{x) — 1/x we obtain 
the logarithmic entropy density E(x) := xlogx and the family 

ry I 

Ea(x) := / (y — r) - dr = a; log a; — a log a — (1 + log a) (a; — a) , 
Ja r 

which provides useful lower/upper bounds for ip. In fact, h being concave, if h{0) = 0, 
then h{x) > h{a) x if < a; < a, so that 

g{x)<^^ and iiix) < g{a)£a{x) Vxe(0,a]. 
x 

15 



On the other hand, when x > a, we have h(x) < h{a) x, so that 

gix) > and V(a;) > ff(a) ^a{x) V a; € [a, +00) , (4.1) 

X 

thus showing that ipix) has a superhnear growth as x t 00. 
We can therefore introduce the relative entropy functional 

^'(p) := / tlJa{p{x))dj{x) ^ / (■ilj{p{x))-^{a)]dj with a = / pd-f . 

In the particular case ■(/) = £, we set 

■^(p) / pfog/3(i7 — afoga with o, = pdj ■ 

Since V' is convex and superhnearly increasing, if sup„ < 00, then there exists 

a subsequence weakly converging to p in L^(R'') and 

liminf *(p„) > . 

Remark 4.1. // i/ie function ip satisfies ip" — g, ip{0) = and if (12. 4p /loMs, 
i/ien V' ttlso satisfies McCann's conditions, i.e. the map x ^-^ ip{e~^) is convex and 
non increasing on (0, 00) or, equivalently, 

xip'-ip>0 and x'^ ^" - x->p' + ^ > V a; > . 

The convexity of ip indeed yields xip'{x) — ipix) > — ip{0) ~ 0. Consider the 
function 'd{x) := x"^ ijj" {x) — xil>'(x) + 'ip{x) and observe that \imxio'd{x) = 0, since 
ip" = 1/h and h is concave so that, in particular, h{x) > cx near x = 0, for some 
positive constant c. On the other hand, we have 



■d'{x) — x^ g'{x) + X g{x) — X ■ 



X 



dx \h{x) 

and the function x 1— >■ h{x)/x being positive, non increasing, we deduce that d'{x) > 0, 
so that ■d>0. 

Let us introduce the Sobolev spaces 

W^'PiR") [p e W,];^^{R^) : {\p\P + \Dp\p) dj < ^} . 
For p € W^'^(M''), p > 0, we define the entropy production functional as 

P{p) ~ Dp) with domain 1){P) |p G W^^\R'^) : p > , P{p) < ooj . 
We also introduce the absolutely continuous functions 



and observe that 







^L^(r) = ri/)"(r) = r.g(r) = ^ 
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is bounded if and only if h(r) has a linear growth as r f cx). In the case h(r) = r, 
t/i = E, to the entropy functional IK corresponds the entropy production functional 



Proposition 4.2. Let p he nonnegative function in Ll^{W'-). Then p e Wy^(K'^) 
and P{p) < oo if and only if Df{p) S L'^{W^]W^) and in this case we have 

P[p)= I \Df{p)\'dj. 

If p e T){P) and h{r) >hr for some constant h> 0, then L^{p) € Wy^(M'^), 

f L^MpIP < h-ip(p) and P(p) < h-ij(p) . (4.2) 

Moreover, the functional p i— > P{p) is lower semicontinuous with respect to the weak 
convergence in Ll^{M.'^), i.e. if a sequence {pn)n£N weakly converges to some p in 
L^iR"^) and sup„gNP(p„) < oo, then p G W^^^iR"^) and 

liniinfP(p„) > P(p) . (4.3) 

nfoo 



Proof. Identity (|4.3p and D L^{p) — pg{p) Dp are straightforward if p takes its 
values in a compact interval of (0, oo). The general case follows as in Lemma [3.31 bv 
a standard truncation argument, while the lower semicontinuity is a consequence of 
convexity. □ 

5. The KFP flow and its first variation. 

5.1. Variational solutions to the KFP flow. As in Section[21 let us introduce 
the differential operators 

Vy-v.^e^V ■ (e-^v) ^ V ■ v - v ■ DV , 
A^p := V^, • {Dp) = Ap- Dp- DV , 

which, with respect to the measure 7, satisfy the following "integration by parts 
formulae" against test functions C € C^(R'^): 

/ v-DQd-y^-i V~fvCd-f and / Dv-DCd-f^- A^vCdj. 

We consider the Kolmogorov-Fokker-Planck equation 

dtpt - A^Pt = in (0, 00) x M'' . (5.1) 

For simplicity, we will consider equations in the whole M.'^ (corresponding to the finite- 
ness assumption on the potential V) ; necessary adaptations when this is not the case 
are straightforward and left to the reader. We will also assume that 

the potential V is smooth with bounded second derivatives . (5-2) 
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Based on the integration by parts formula, the variational formulation of (|5.ip in the 
Hilbert space L^(M'*) relies on the symmetric, closed Dirichlet form 



where Wy^(R'^) is endowed with its natural norm ||p||^i,2^jjij := ||p||^2(Rd)+ a^ip, p)- 
Using smooth approximations, it is not difficult to prove that W^'^(R'') is dense in 
Ll^iW^). The abstract theory of variational evolution equation and the log-concavity 
of the measure 7 yield the following result (see e.g. 0] Thm. 6.7]). 

Proposition 5.1. Assume that (I2H)-(IH1) hold. For every po G ^^(R'^), the 
solution of (j5.ip has the following properties: 

1. There exists a unique pt — Stpa € ^\oc (O; 00; L^(]R'')) , t > 0, such that 

^(pt,?7)i2(R.)+a^(pt,?7)=0 yr]eW^'^{R''), limp^ = po (R^) . 

(5.3) 

If Pmin < Po < Pmax; then pt Satisfies the same uniform bounds. The semi- 
group {St)t>o is an analytic Markov semigroup in L^(R'^) which can be ex- 
tended by continuity to a contraction semigroup in L?^(R'^) for every p G [1, 00) 
and to a weakly * continuous semigroup in L^(R'^). 

2. For every p, a G L^(R''), we have 



{Stp)adj^ / p{St<j)dj yt>o. 

3. For every t>0, St maps L^(R^) into Cb(R'') and Lipf,(R'^) into itself with 
the uniform bound 

[StplupiR") < [plupiR") V < > , V p e Lipb(M'^) . 

4- If Pa ^ Oi j^d Pod-l < 00 and 'K{po) < 00, then the map t H> ^{pt) is 
convex, pt G W^'^{M.'^) for every time t > 0, and 

sup / |a;|Vtd7<c3o, J{(pO = -J(Pt) , ^^U^' HPt)) < ■ 
tG[o,T]JK<i at dt\ / 

Notice that the Assumption po G L^(R'^) is not needed in Property 4, according 
to H Thm. 6.7]. 

5.2. Measure valued solutions to the FP flow. Wc first recall some basic 
results on measure-valued solutions of the Fokkcr-Planck (FP) equation 

dtPt = Apt + y-iDVpt) (t, x) e (0, +CX)) X R'^ . (5.4) 

Solutions of (|5.4p are understood in the sense of distributions, i.e. for any T > and 

if G C^i[0, T] X R'^), we have 

/ ipTdpT= / •^dpa+ I I [dtift + ^ft - DV ■ Dipt j dpt dt . (5.5) 

18 



For any ^ E M+(M''), we denote by mp(/i), p e [l,oo), the p-moment of fi, i.e. 
mp(/i) :— Jjgj \x\P d^{x). By 'J'2{^'^) we denote the space of probabihty measures 
on M'' with finite second moment m2. The relative entropy of p with respect to 7 is 
defined as 



J{(^|7) 



/ p log p (^7 if /i ^ 7 and iJ = pj , ^{p | 7) := +00 otherwise . 



Given two probability measures p and v in J'(R'*), the classical Wasserstein distance 
is defined as W^2(m,!^) inf{[/jj,^jj, |y - dI]]i/2 . j:eT{p,u)}. lieieT{p,u) 

is the set of all couplings between p and v. it consists of all probability measures S on 
^ j^d -^yj^Qgg fjj-g^ a,nd second marginals are respectively p and v, i.e. x M^) = 

p{B) and x = v{B) for any B e 'B{W^). Notice that the notation W2 is not 

consistent with the one for weighted distances Wh', we shall however use it as it is 

classical. 

For a proof of the next results see e.g. O Sect. 3]. 

Proposition 5.2 (Uniqueness and stability of the solutions of FP). Let po G 
T2(K''). 

1. The FP equation (|5.5p has a unique solution pt — Stpo in the class of weakly 
continuous maps 1 1-^ pt E M+(R'') with sup^^j-Q q^-^ m2(/it) < +00. 

2. The unique solution pt is continuous with respect to the Wasserstein distance 
W2 and Lipschitz continuous in all compact intervals [to,ii] C (0, +00). 

3. It is characterized by the family of variational inequalities 

i ^Wiipu + ^ Wiipu v) + nv^t I 7) < ^(^ 1 7) V e T2(K') . 

^. In addition, it is stable: p^ — > pQ in J'2(II^'*) implies that /ij* -> Pt m 0^2 (K'^) 
for all t > 0. 

Notice that the measure 7 provides a stationary solution of (|5.4p . All solutions 
Pt weakly converge to 7 as t +00. Finally, pt is absolutely continuous with respect 
to 7 for any t > 0, with density pt, and pt is a solution of the KFP flow. 

5.3. Variational solutions to the modified KFP equation. We consider the 
flrst variation of the KFP flow, i.e. the modified Kolmogorov-Fokker-Planck equation 

dtWt - A^wt + D'^V Wt = in (0,00) X M'' , \imwt ^ Wq in L^(K'^; K'^) (5.6) 

for the vector field w : (0,oo) x R'^ ^ M.'^. In the Hilbert space W := Wjf'^iM.'^-M'^), 
we consider the continuous (recall (|5.2p ') bilinear form 

ay{v,w) :— / (Dv:Dw + D^Vv-w]dj. 

We look for solutions w E Wi^'^^((0, 00); (Rd; Rd)) n L\^Jy\^,oo);W) solving the 
variational formulation 

d f 

- WfCdl + a^{wtX)^^ VCeW^. (5.7) 



dt 

Observe that vector fields in CliW^-.W^) belong to W. Actually the space of smooth 
compactly supported functions C^(R'^;R'') is dense in W , and W itself is dense in 
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L^(M'';M''). Notice moreover that if C : M [0, oo) is a smooth convex function 
with bounded second order derivatives and ^(0) = 0, and z{w) := ^-j^p-w (with 
z{0) — 0), an easy calculation shows that solutions of (|5.7p satisfy 

d f 

~17 C{\wt\)d^ ^ ayiwt,z{wt)) >0 a.e. in (0, oo). 

at jjjd 

With these observations in hand, we can apply the variational theory of evolution 
equations and a simple regularization argument to prove the next result. 

Proposition 5.3. For every wo & L^(R'^;R''), there exists a unique solution 
w = Rwo of ^ mW^ji^f((0,c»);i2(Rd;Rd))nLj2^^([0,oo);Ty) withlimt^oWt = Wq 
in _L^(M''; M''). The semigroup R is symmetric 

/ RtW-zd-/= w-Rtzd-f y w , z e L'^{R'^;R'^) , Vt>0, 
and satisfies 

I C(|i2t«^ol)d7< / C(kol) for every L^^iR^-X) 

and every convex function C : M — [0, oo) with C(0) = 0. In particular R can be 
extended by density to a contraction semigroup in LP(R'^;R'^), p € [l,oo]. 

The link between (|5.1I) and (|5.6I) is enlightened by the next result. 

Theorem 5.4. If pt is a variational solution of the KFP equation (15. ip with 
initial datum po € Wy^{R'^), then Wt Dpt belongs to C°{[0,oo); L'^{R'^)) and it 
is the solution of the modified KFP equation (|5.6p with initial datum Wq := DpQ. In 
particular we have 

[ DStp-wdj= [ Dp-Rtwdj y peW^'^iR"^) , Vii; e i^^Rd.j^d-) ^ 

jR<i jR'i 

The same result holds if po belongs to Wy^{R'^) . 

Proof. Since T>{A^) is dense in W^'^{R'^), we can assume that po G D{Ay). 
Then the regularity result of Proposition 15. II shows that pt S D(A^) for every t > 0. 
Setting Wt := Dpt, we know (see e.g. the argument in the proof of [22] Lemma 5.2]) 
that a-y{wt,Wt) < ||A^(Oo||L2(Rd) < +oo. For a fixed C G C^{R'^;R'^), we can then 
evaluate 

/ WfCdj^^f DpfCd-i^~^[ V, . Cd7 = / Dpf r>(V, • C) ^7 ■ 

JR'* JKJ^ at J]g,d Jjjd 

(5.8) 

With the notations di — d/dxi and dij = /dxidxj for i, j — 1, 2. . . d, let us observe 
that 

i i 

and 

Dpt ■ DiV, • C) = E ^iP^ ^3^^ - ^oPt djQ d^V - d,pt Q df^V . 
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d_ 
'di 



Inserting this expression in (|5.8p and integrating by parts the first term we get 
/ L>Pfi>(V^-C)d7 

= - / (Diot : DC + D^wt • C) d7 = - a7('^t' • 



Combined with ()5.8p . this shows that Wt '■— Dpt satisfies the variational formulation 
of (|5.6p . The case of po G W^'^(M'^) foUows by a standard approximation procedure, 
the fact that DStPo — RtDpo, and the Li^-contraction property of R. □ 

5.4. Measure valued solutions to the modified KFP equation. Exactly 
like the (K)FP equation, the modified system can be extended to vector-valued mea- 
sures initial data. To Wt, we associate the vector valued measures Ut ■= Wtj € 
!"^-R'^) which satisfy the system 



dti^t = Ai^t + V • {DV ® ut) - D^V ut , 

in the weak sense, i.e. 

4/ C-dut^ I ( AC - DC(^ DV - D^V C) ■ dut y C ^ C^d^'') ■ (5-9) 

The semigroup can be extended to initial data which are vector valued measures with 
finite total variation using equi-integrability and moment estimates taken from 121) . 

Proposition 5.5 (Equi-integrability and moment estimates). Let ( be a non- 
negative Borel function such that /i(C^) = /rii dp, and 7(C^) = /rii d'^ are finite. 
//$(/i, I/) < oo, we have 



In particular, for every Borel set A S ®(R'^) we have 

(|i.|(A)) ' < <i>(p, v) 7(A) h{p{A)h{A)) (5.10) 
which in particular yields (7(R'') — 1) 

i/|(M''))^ < $(/i,iy)/i(p(M'*)) . 
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If moreover m2(yLt) < oo, we can bound the first moment of \v\ by 

v<m{\u\)= [ \x\d\u\<U{^l.u)^a2{-1)h{m2{^l)/rn2{l)))''^ ■ (5.11) 

Theorem 5.6. For every Vq e M(R'*;R'*) with mi{\uo\) < +00, there exists a 
unique solution Ut = ^^ti'o ^'^ the class of weakly continuous maps 

with supjgjgjij mi(|ft|) < +00, for every final time T > 0. When Uq — Wq"/ then 
^Iti^o = RtWo"f. The map Uq h- > OIuq is stable in the following sense: if 

Vo^*vo weakly* in M(R'';M'^) with sup mi(|i/^'|) < +00 

n 

then 3l(i/[f^*3ltZ^o m M(R'*;R'*). 

Proof. We divide the proof in three steps. 
Step 1. Let us first associate to v = E M(R'';R'^) the probabihty measure 

V := ^^^=\u\ = ^ J"^' 7ey(R'^), (5.12) 

where the constant M is a renormahzation factor such that u(R'') = 1. Observe that 
if mi(|i/|) = Jjjd |x| |ii'(a;)| ^7 is finite, then v e J'2(R'*) and 

m2(t^)<^mi(|i.|). (5.13) 

We also choose the action density to be (/'2(p, w) := \w\'^/ p corresponding to h{p) = p, 
and observe that the corresponding functional writes 

<^2{v,w) = M j y/T+\^\w{x)\d-i<M(\u\{W^) + mi[\u\)]. (5.14) 

Proposition 5.7. Let us suppose that w e L^(R'';R'') with mi(|iv|) < +00 and 
set u :— w^, V as in (j5.12p . Wt = RtW, ft = Wt^, vt — Stv. Then 

|i/t|(R^) < li/KR'^) , midz^tl) < |«^|(R'*) + 2mi(|«^|)+4Mm2(7), (5.15) 

and for any t > 0, we have 

(\ut\{A)y <Al(\u\{W') + m,{\u\)')vt{A) WAe-BiR"). (5.16) 

Proof. The first inequality of (j5.15p follows by the Li^-contraction property of R. 
Since the FP flow contracts the Wasserstein distance by Proposition 15.21 and since 7 
is a stationary solution, the triangle inequality for the Wasserstein distance and the 
fact that ^ rn2(^) ~ VF2(/x,(5o) yield 



\/m^(vtj < W2(wt,7) + \/m2(7) < W2(u,7) + \/rr^^(j) < V^tt2M + 2 V^It) 
On the other hand, (|5.11|) yields 

mi(|i^t|) < \/m2{vt) \/^2{vt,t^t) < ( Vm2(w) + 2^/m^(J)) ^/^^(v^ . 
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Here we used the fact that ^2{''Jt,'^t) < ^2{v,^')- This will appear later as a con- 
sequence of Theorem 16.11 and is independent of the present result. Combined with 
(|5.13p and (|5.14l) . this proves the estimate on mi(|ft|). 

Applying ((5?T0| and ((5TT4)) . we get ((5T6)) . □ 
Step 2: existence. Let us approximate a given Vq S M(]R'*; W^) with rriidivol) < +00 
by a sequence v'' = w''j^*Uo as A: ^ 00 in M(M'';K'') with Wk € L^^j^rf.j^rf) 
and midiv^jj) — > mi(li'ol)- We set :— i«t 7 with = RtW^ so that solves 
(|5.9p . Thanks to Proposition 15.71 we know that the first order moment of are 
uniformly bounded. This is sufficient to pass to the limit (up to extraction of a 
suitable subsequence) in (|5.9p and to find a solution Vt which is weakly* continuous 
in M(R'^; W^) and satisfies the initial condition in the sense that iyt-^*h'o in M(R''; R'') 
as t 4, 0. 

Step 3: uniqueness and stability. It follows by a standard duality argument, like 
in the case of Equation (|5.6p . If and are two weakly continuous solutions of 
(|5.9p . their difference (Xt := v\ — solves 

I CT d(TT= I I (dtCt + ACt - DCt ®DV- D^VCt) ' dat dt (5.17) 

for every T > and C e C;?°([0, T] x M^';M^'). By a mollification technique, it is not 
difficult to check that ((gTTl) also holds for every function C e C([0,T] x W^;W^) with 
9((/9, DC, and D^C, continuous and bounded in [0,r] x W^. 

Next, we introduce a family of smooth convex potentials Vn with bounded deriva- 
tives of arbitrary orders, which satisfies a uniform Lipschitz condition 

\DVn{x) - DV^{y)\ < L\x - y\ V x, y e R^ 

for some positive constant L which is independent of n and such that 

V,,^V , DVa DV , D'^Vn^ D'^V pointwise as n — > 00 . 

For a given 77 e C^(R''; R''), we consider the solution C,t of the time reversed (adjoint) 
parabolic equation 

dtCt + ^Ct- DCfDV - D^V DCt =i) in(0,r)xR'', Ct = ^ • 

Using a maximum principle that can be found in [23 and the fact that the first and 
second order spatial derivatives of C, solve an analogous equation, standard parabolic 
regularity theory shows that C is sufficiently regular to be used as a test function in 
(|5.17p and satisfies the uniform bound (observe that the second and third derivatives 
of Vn are still uniformly bounded) 

sup|CJ + |r>C„l<C<+^. 

t.x 



This leads to 



<C / {\DV -DVn\ + \D''V -D''Vn\]d\cTt\dt 



T] ■ dcTj 

Since the first order moment of \at\ is uniformly bounded, we can pass to the limit 
as n — )■ 00 obtaining J^^ r) ■ dax — 0. As rj is arbitrary, we conclude that u^p = u^. 
The stability is then a simple consequence of uniqueness. □ 
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6. Action decay along the KFP flow and consequences. We can prove now 
our main estimate, which is a refined version of Theorem 12.11 under the assumption 



Notice that (16.11) is equivalent to p.3p with j3 :— (1 ~ a)/(l + a). 

Theorem 6.1. Assume that (|2T|) and ([23| hold, and let {p,w) e LI^{W\R+) x 
Xi(M'^,R'') be such that '^{p,w) < oo. // dSJ]) is satisfied, then 



Proof. We first prove the result with the additional assumptions that < Pmin ^ 
p < Pmax and \w\ < lUinax 7 a.c. in M''. Assume that h is of class C^(0, cx)). It 
follows that pt — Stp and Wt = RfW satisfy the same bounds and, for all t > 0, pt-, 
dtpt € W}^''^(W^) and Wt, dtWt € W]f''^{M.'^;M.'^). The function is of class in the 
strip 



and its differential D(j){p, w) can be decomposed as 

Dp<Pip, w) ^ g' (p) \w\^ , D^<j){p, w) = 2 g{p) w . 

Since g{p) and g'{p) are bounded, the differential is also in L'^{W^\W^^^). As a 
consequence, the time derivative of i i— <^{pt,Wt) exists and 



In order to apply (|5.3p and (15. 7p we have to verify that all components of D(j){pt,Wt) 
are in W^''^{M.'^). We have already seen that they are in Let us compute their 

x-derivative: 

D {Dp^{pt,wt)) ^ g"{pt)\wt\'' Dpt + 2g\pt)wt ■ Dwt , 
D{D^^(|){pt,Wt)) = 2g'[pt)w\Dpt + 2g{pt)Dw\ for any i = 1 , 2,...d. 

The above functions are in L'iiW'-), since g(pt), g'{pt), g"{pt) and toj are bounded, so 



that 



The function h is concave and, for some /3 G [0, 1), 
{1 — j3) h h" + 2 l3 {h'Y < holds in the sense of distributions. 



(6.1) 




Q ■■= [Pmin,Pmax] X {li; G R'' : \w\ < Wmax} 




we get 





Recalling p.4p and the convexity assumption on V, we find 



-^Puwt) <-2pY,^Pt,d,wt)-2X^pt,wt) . 
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It follows from Gronwall's lemma that for all s G (0,i), 



The result follows by passing to the limit as s J, and recalling that (j) is continuous 
and bounded on Q and ps, Ws converge to p, ii; as s | in L^(R'') and L^(R'^;R'') 
respectively. The general result for an arbitrary concave function h easily follows by 
approximating ft, by a decreasing family of smooth concave functions in the interval 
[pmin,Pmax]- Finally, the general case p G Ll^iW^), w G Lj^(R''; R''), without upper 
and lower bounds, follows by approximation, using Lemmas 13.31 and 13.51 □ 

We can extend the results of Theorem 16. II to measure valued initial data. 

Corollary 6.2. Assume that (|2?Tt- (|2^ hold. Let p e J'2(R'^) and u G 
M(R'^;R'^) with mi(|i>'|) < +00. Then for every t > we have pt = §tp = ptl, 
vt = 3lti^ = wt-i with Pt G W^|i(R''), wt G H^|i(R'*;R'*) if P > 0, and 

<i>{pt,wt)+2(3y2 HPs,d.,Ws)e^^(-'-'Us<e-^^'<S>{p,i^\j) yt>0. 

Proof. This follows directly from the measure formulation of the KFP flow (Propo- 
sition [5]2] and Theorem 15.61) . □ 

Let us now consider the entropy functional ^{p) := /jj^ ip{p) dj, for a function ip 
as in Sectional 

Theorem 6.3. Let p G with J^a pdj < 00 and let pt := Stp. Then 

^'(pt) < 00 and P{pt) < 00 for every t > 0, and we have 

^^^fipt) = -P{pt) and j^P{pt) + 2\P{pt)<0. 

As a consequence, we have 

^(Pt)<e-2^**(p), tP{pt)<{l + 2Xt)e-'^'^{p) , P{pt)<e~^^'P(p) 

for any t > and the following entropy - entropy production inequality, or general- 
ized Poincare inequality, holds 

'^{p) < 7^ P{p) , ypeVi'i') such that [ \x\'^ p d'^ < 00 . 

Proof. It is not restrictive to assume that J^a pd'^ — 1. We first prove Theorem l6.3l 
for a function h which grows at least linearly at 00, and therefore satisfies h{r) > hr 
for some constant h > 0. The general result follows by writing h as the limit of a 
decreasing sequence of such concave functions ft,„, observing that the corresponding 
actions (/>„ and entropies ipn converge increasingly to (f> and ip respectively. 

By (|4.ip we know that ^{p) is finite and therefore we have 

/ J{pt) dt < < 00 and / ^^^'I'^P*^^^ d7 < 00 , 

Jo Jo Pt 
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where the second estimate follows from 

Applying the chain rule for convex functionals in Wasserstein spaces (see for 
instance [21 P- 233], we obtain that the map t ^'(pt) is absolutely continuous and 

^,T,/ A / DL^{pt) Dpt 

-TT^lPt) = / Ptd-i^P{pt). 

at J^d Pt Pt 

By combining Theorems 15.41 and 16.11 applied with Wt :~ Dpt and differentiating 
with respect to i, we get that —-^Pipt) > '^^P{pt)- AH other estimate are easy 
consequences that have already been established in Section [TJ □ 

7. Contraction of the /i- Wasserstein distance and KFP as a gradient 

flow. Consider the space 3^h,'y{W^) of probability measures at finite Wh.-^ distance 
from 7. From (|3.ip . we know that 7 has finite quadratic moments and, as a con- 
sequence of [20] Theorem 5.9], any measure in 3^h,i{^'^) also has finite quadratic 
moments. The same result holds for moments of higher order. 

Theorem 7.1. For every <7,r/ e ^/i_^(R''), we have 

Wh^^iSta , §tv) < e-^' Wh,^ia, 77) V t > . 



Proof. It is a straightforward consequence of CoroUarv 16.21 and Theorem 13.81 □ 
Theorem 7.2. For every p G 

^hni^'^), we have 

~Wl^{§tP,'j) + ^Wl^{§tt^,a) + ^{§tp\l)<^{a\j) VaeDW. (7.1) 

Proof. Let us first notice that since ^h,-)(^'^) is stable under the action of the 
semigroup (§t), it is sufficient to prove (|7.1I) only at t = 0, under the assumption 
that p writes as BtP-, for some t > 0. We make the additional assumption on the 
function h that there exists some h > for which 

/i(r)>hr Vr>0. (7.2) 

This assumption will be removed later in the proof. Let e > fixed and {p^,w^) G 
L^{R'^) X L-'^(M'', M''), s e [0, 1], be an admissible curve connecting a to p such that 

Wl^{p,a) < <^^{p\w') < Wl^{p,a)+e, 

where <S',f,{p^, w^) := ^{p^, w^) ds. For any k > 0, we take pf. ~ p'^ + k > k. Since 
h is non decreasing, we still have 

Mp:,wn<Wl^{p,a)+s. (7.3) 

Notice that, thanks to [51] Theorem 5.17] (also see Theorem 13. 8p . it is possible to 
assume that 

£M,'>^n = ^{pi,^n (7.4) 
is constant with respect to s € [0, 1]. For t > 0, we set 

w'/ = RstW -tDp'/ . 
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It is clear that (p*'*, if*'*) connects a + K7 to §t{fi + Kj) — Stfi + K-f. Note that, thanks 
to the maximum principle, we have p^'* > k. We claim that it is admissible. Indeed, 

since {p%,w'') is admissible. Hence, 

dsPf;' = ^ ■ {-Rstw^ + tDpf) = -V^ • {wf:') . 
It follows from the definition of ^ that 

Wl^iStP + Kj,a + Kj) < S^{pf,w''') , 
hence, with (|7.3p and \7A\ . we obtain 



(7.5) 

By definition of we have 

W/,wf) ^ f\ {S^tPi, Rstw^ -tDp^/) ds , 
Jo 



where 

Hs..PiR..^^-tDpr) = [ '^-":rA^^"^ 7 



Hpk ) Jr'' h{pf; ) jRd h{p^K: ) 

JB<i h{pK ) ) 



and hence, 



^^{pt:\w:j')<£^iSstPlR,tW^)-2t I I (7.6) 



jR'i /i(Pk ) 

Lemma 7.3. // (I7.2p holds, then we have 

f I s r^'* ^7 - ^(§tM + ^7 I 7) - ^(^ + ^7 I 7) ■ (7.7) 

Proof. Recall that p^'* = SstP%, with p^ e i^(M''). Then, acting as in the proof 
of Proposition |4?2l we get that 

lE^lfL ds < oo Vr>0. (7.8) 
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The assumption (17. 2p on h then leads to 



s,t djds<— I I djds <oo 



The next step consists in proving that 

|2 



Note that p^'* > k and the concavity of h imphes that 
hence 



(7.9) 



jj — d7 ds < —— / / , , d-f ds 



h{K) 



<^/ / ^^d^ds<oo 



since the KFP flow decreases the action. The bound (|7.9p immediately follows from 
the previous one and (|7.8p . As a consequence, we can apply the chain rule in Wasser- 
stein space, which implies that the function s ^(jO*-*) is absolutely continuous on 
[r, 1] and, for aU s e [r, 1], 

i I ^7 = / . ^ = / . .7.10) 

Integrating (I7.10p on [r, 1] and letting r go to finally leads to (|7.7I) . □ 

Let us go back to the proof of Theorem 17.21 We put (|7.5p and (|7.6p together and 
obtain 

\ [Wl^i^tP + K7, + «7) - ct)) 

< \ ['^^{SstPl, Rstw^) - <^M,^n] + t [*(^ + '^T I 7) - n^tP + I 7)] + f • 
We then use the main estimate in Theorem 16. II with (3 = and (17. 4p to write 

i[^*(5,tp^„i2,iti;^)-^<,(p^,ti;^)] < -l/,(t)A>(p^,t«^) 

<-l hit) Wl^iStP + Kj,a + K7) , 
where h{t) := (l - e^^Ai^) it follows that 

I [Wl^i^tp + ni,<J + K7) - Wl^ip, a)] + i h{t) Wl^iStP + K1,<J + K7) 

< t [*((T + K7 I 7) - *(StM + «:7 I 7)] + I • 
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If we first let e and then k go to in the above estimate, we get that 



- [Wl^($t^^, a) - Wl^{^l, a)] + - hit) Wl^{$t^l, a) < t [^{a \ 7) - *(StA^ | 7)] 

(7.11) 

as soon as h satisfies the assumption (|7.2p . Now, any concave and non decreas- 
ing function h can be decreasingly approched by a sequence (ft,„) satisfying (|7.2p . 
and the corresponding entropies converge increasingly. Then, with Theorem 13.71 In- 
equality (|7.1ip turns out to be valid for any general h. To complete the proof of 
Theorem 1 7. 2 [ it just remains to divide (|7.1ip by t and let t go to 0. □ 
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